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1. INTRODUCTION 

Classification is one of the data mining algorithms that aims to enable computer or processor to 
carry out learning by methods and algorithms to fit into a machine learning concept [1]. Classification 
involves training and testing data [2] in form of text, images, and videos [3]-[5]. Learning process in defining 
the model may involve data properties that referred to as data features, so that learning process exerts 
identification, decision support, and method improvement [6]. Classification has become one of the major 
focuses on computer science. Data mining algorithm is able to analyze complex dataset and to predict various 
newly inputted data by not making general assumptions on data distribution. In support to data mining study, 
complex data in various applications have been collected for classification study purposes that are referred to 
as datasets [7]. Dataset may in form of text records, literature images, old scriptures, speech records, business 
data, pharmaceutical data or medical images. 

Skin cancer is the primary cancer desease in some countries with death rate is up to 10% [8]. 
Number of skin cancer patients increases steadily every year. Skin disease detection and examination are 
important to determint the correct treatment. However, classification study on cancer images still requires 
accuracy prediction that has been research challenges, including research on skin disease classification 
algorithm. However, low level feature extractions for skin deseases based on shape, texture, and color 
properties extracted from the lesion image are still considered as lack of clinical meaning [9]. High level 


Journal homepage: http://ijai.iaescore.com 


422 i) ISSN: 2252-8938 


clinical oriented features are expected to contribute more by applying various color normalization 
approaches. Techniques such as content-based image retrieval [10] and short term of lesions monitoring are 
expected to provide clinical meaningful dataset and to enrich the classification diagnostic. Moreover, if the 
rich visual feature is complemented by the clinical metadata, diagnostic accuracy may increase and treatment 
may be proper. 

Traditional classifier such as support vector machine (SVM) is widely used to classify skin cancer 
images. For instance, Alquran et al. [9] skin disease analysis using SVM with principal component analysis 
(PCA) to classify skin disease dataset. Feature extractions were performed in four steps (RGB image, binary 
image, lesion image and histogram equalized image) to get kurtosis, contrast, mean, skewness, energy, 
circulation, standard deviation, homogeneity and correlation.. The features were then fed into PCA, before 
SVM classification was performed. The five selected feature exterted the accuracy of 92.1%. Meanwhile, 
Chung et al. [11] used SVM with the help of persistence diagrams from persistent homology algorithm 
(which is part of the topological data analysis (TDA)) for segmenting images. The method also combines the 
persistent homology with the persistence stats with the Betti and entropy curves before classified by using 
SVM. Best validation score was achieved when XYZ curves, XYZ stats, RGB stats were used as selected 
features. 

However, traditional classifiers such as SVM, neural network and decision tree may have 
difficulties in dealing with large dataset and frequent variation in fault pattern. Although these traditional 
classifier may result high accuracy for given dataset, future complex and rich dataset require more efficient 
classifier. Simple evolving connectionist system (SECoS) or so called as evolved multilayer perceptron [12] 
is the smallest realization of ECoS that has capability in forecasting big data pattern. This method is 
powerfull in processing the rich dataset. SECoS applies the ECoS principle by developing the multilayer 
preceptron (MLP) method [13]. SECoS is sometimes referred to as the evolving multilayer perceptron 
(eMLP) [14]. SECoS has three layers, namely input, hidden and output layerr [15]. The input layer processes 
raw input data for testing and training. The hidden layer calculates the SECoS method by using some 
formulas that have been developed in accordance to the ECoS principle. The process is carried out evolvely 
where the hidden layer increases the object properties to get the specified value. The output layer is to get 
classification value [16]-[18]. 

SECos is able to perform data classification in various fields [19]. The work in [20] used SECoS to 
classify stroke data texts to help patient treatment by determining how much fibrinogen should be consumed. 
The classification process is very helpful in assisting doctors and patients. In addition, the method has also 
been used to classify images of the Qur'an verses to help readers understanding the tajweed law in reading 
the Qur'an [21]. Since data is not a text type, feature extractions were performed [22], [23]. SECoS can be 
integrated into a single image processing module [24], [25], for instance, to isolate phoneme recognition [26] 
where large data patten can be memorized and new inputted data can be adapted. SECoS reduces a large 
number of hidden nodes. SECoS has also been used for adaptive speech recognition, business analysis, as 
well as health and pharmaceutical analysis [27]. This paper identifies skin cancer from a balanced skin cancer 
images dataset of malignant and benign skin images. 


2. MATERIAL AND METHOD 
2.1. Dataset 

The employed data set in this paper is based on previous research [28] that taken from a balanced 
skin cancer image dataset of malignant and benign skin images. This skin cancer dataset is accessible and 
available online at kaggle. The images were gathered by the collaboration of skin images (ISIC) for study and 
implementation purposes. There are 360 images: 16.7% falls under the category of malignant melanomas and 
83.3% is in benign category. 

The classification is performed based on the 224x224 colored images. The dataset itself contains 
1800 datasets, where 136 datasets with 100 training data and 36 testing data were selected. The classification 
targets are “benign” and “malignant”. In order to fasten algorithm execution time, the 224x224 colored 
image was converted into 28x28 grayscale image. 


2.2. SECoS architecture 

Figure 1 shows the common architecture of SECoS algorithm in classifying skin cancer discussed 
throughout this paper. The architecture describes steps involved in classification such as grayscaling, feature 
extraction, data training and data testing. 

Initially, data set is splitted into training and testing data where the distribution of bboth benign and 
malignant data are provided. These images are then converted into grayscale images. A grayscale image is 
represented by a single sample value in each pixel to represent the amount of light. Grayscale image can be in 
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form of black/white or gray monochrome image. Black is the weakest light intensity and white is the 
strongest. Next is to do feature extraction with the oriented gradient histograms (HOG). HOG feature 
extraction determines the frequency of gradient orientation in a portion of an analyzed image by using 
contrast normalization. This gradient field captures the image spatial structure. HOG descriptors are familiar 
to be used on computer vision and recognition. HOG descriptors extract features for machine learning 
algorithm. HOG feature extraction extracts grayscale images features to be used by SECS algorithm. 


Input Images 
Grayscaling 


HOG Feature 
extraction 


J ae 
Update weight SeCOS NN : 


Testing : 


Results and MAPE 


Figure 1. General architecture 


Data is then trained in SECoS and the weight is updated. If the weight is out of scope, then it returns 
to SECoS, but if it is correct then next step is performed. Data testing in SECoS is performed by updating 
weights. If the weight is correct, the next step is performed. MAPE is calculated by using (1), where actual 
value, classification value and number of data are represented by a, b, and n subsequently. 

n__|a-b 
MAPE = zal 


x 100% () 


2.3. SECoS development on normalized euclidean distance 

SECoS has a neural network that extends to the hidden layer and the output layer as described in 
Figure 2(a). Figure 2(a) shows that three three layers of neuron processes. The first layer is the input layer, 
followed by the evolving or constructive stage as the second layer and output layer as the fial stage. The first 
layer correlates to all second layer. The second layer adds neurons and proceeds learning. The evolved layer 
neurons are turned on according to distance of Eucledian between the input part of SECoS and the evolving 
layer. Finally, the results go to output layer. SECoS is fully connected network where the connections 
number from input to evolving layer equals to the number of input neurons. By using the tested patient data 
of -455 G/A polimorphism of beta fibrinogen gene, the normalized Euclidean has the smallest MAPE 
compared to Hamming and Manhattan distances as shown in Figure 2(b). The performance differences show 
that the normalized Euclidean distance has the smallest MAPE. Although difference is lower than 1.8% 
which means that distance formula exerts insignificant effect on the training results accuracy, Euclidean will 
be employed in this reasearch. Therefore the Euclidean distance can be directly calculated. Since distance in 
the range of [0, 1], input data should be normalized [29]. 
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Figure 2. SECoS neural network and distance comparison [30] 


In general, feedforward networks have an activation function. But, SECoS uses an activation 
function that is linear and called as activation A at the evolving layer node n that is determined by (2). 


Ay =1-—Dy (2) 


An is an activation function at nodes n and Dn which is determined by using the normalized 
calculated distance that lies in between of input weight and the output weight at the destination node. 
Generally, Dn is a normalized Hamming distance, but this paper refers to [31] which has carried out an 
optimization by changing the normalized Hamming distance to normalized Euclidean distance which has 
better results in learning. The normalized Euclidean distance formula is given by (3), where K, nodes number 
at the time of SECoS network input, I is the value that is inputted by incoming vector, W is the matrix of the 
evolving layer weight. 


_ [§-Wil? 
Dn = xem (3) 


Further, learning was performed by analyzing images on the skin cancer dataset. The SeCOS 
learning process injects neurons to the evolved layer only if the value of the linear activation is smaller than 
the threshold of sensitivity or if the calculated error is much higher than the threshold. The outgoing and 
incoming connection weight of the introduced neuron is related to the vectors input and output. Otherwise, 
the weight of connection is adjusted to ensure that the analyzed neuron approximates the input vector. The 
journey of SECoS is outlined by [30]. The vector of input I is fed into the feedforward matrix, if the 
activation value reaches maximum of Amax in the node has value smaller than the threshold of sensitivity 
(Snr), then a new node will be added to the covered layer. Otherwise, SECoS calculates the difference value 
of the input and output vectors, O, and Ou. If the different is higher than the threshold error coefficient (Ein) 
or if the expected output node is unavailable, then SECoS adds a new node to the hidden layer. Otherwise, 
winning node on hidden layer should update its connection weight. The aformentioned step is performed for 
each input without an epoch. 

A node should be added only if the connection weight has a value following the input vector I. 
Output vector weigth is set folowing to the output vector Od. The process ofw feedforward from the covered 
layer to the output is performed in two different ways. The first way is by using the propagation method of 
One-of-N. This propagation is executed only by the covered node with has the maximum value. The 
adjustment of the winning node entry weight j is performed by using (4). 


WWiu, (tt + 1) = WWiu, (tt) + nn Wii WWiij (tt) (4) 
The inputted weight at ii is denoted as WW, jj (tt) jj and WW, ;; (tt + 1) is the weight at (tt + 1). 
Meanwhile, 77; is the learning rate 1 value and IJ; is the i value of input vector I. The adjustment of the 


output weight in node j is performed based on (5). 


(tt + 1)=WW ij ,pp (tt) + nn2AAjj x EEpp (5) 
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Where WW jj, pp (tt) is the value of the exit weight at jj, pp when (tt), WW jj, pp (tt + 1) is the value 
of the weight entered at jj, pp when (tt + 1), 772 is the learning rate 2, AAjj is the activation value of the 
node. 


3. RESULTS AND DISCUSSIONS 

At this stage, the skin cancer classification process for patients with malignant and benign moles has 
been carried out using the modified SECoS method. The initial process includes dataset collection in form of 
images of malignant and benign skin moles. Figure 3 shows the process of grayscale image convertion and 
HOG feature extraction of malignant and benign skin moles before fed into SECoS classifier. The grayscale 
of dataset images is adjusted into the desired color, such as monochromatic colors from black to white. By 
doing this, the grayscale image only has gray and is colorless. The feature extractions were then performed 
by using the HOG feature extractor with the help of tools created with python syntax. 
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Figure 3. HOG feature extraction 


Figure 4 shows sampel of malignant skin moled and benign skin moles and the HOG of those 
images. Figure 4(a) and 4(c) show examples of malignant and benign skin mole images used in the 
experiment. These images describe brushes caused by skin cancer. These images are then converted into 
HOG images as shown in Figure 4(b) and 4(d), which pose more uniform patterns. The HOG images as 
SECoS input have normalized values in range of 0 to 1. In testing stage, the process is similar to the 
calculation in SECOS. First, calculate all parameter values for both learning rates, threshold of sensitivity and 
error and the lot of the training data. Then the propagation is performed starts from input to hidden layers by 
determining the values of activating function including the normalized Euclidean value. The propagation 
starts from hidden to the output layer is performed afterward. The calculated MAPE value is determined from 
the training data based on the pattern that has been obtained. Both training and testing processes performed in 
this paper use 136 randomized images data on malignant and benign moles images. About 100 testing data 
and 36 training data are depicted in Figure 5. 


(a) (b) 


Figure 4. Sample of skin cancer images and their HOGs, (a) malignant skin moles, (b) HOG of malignant 
skin moles, (c) benign skin moles, (d) HOG of benign skin moles 


(c) (d) 


Figure 5 shows training process. There is a dataset on the upper left which is benign and malignant 
skin images data, the lower left dataset is normalized data for benign and malignant skin moles with a range 
of 0 to | and the dataset list on the right is the result of propagation from hidden layer to the output layer. The 
output layer also results the calculated MAPE value from the amount of data being trained. Figure 6 shows 
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the virtual results of the testing data. The generated MAPE produced the minimum results when input 
parameter value of learning rate | is 0.3, learning rate 2 is 0.3, the sensitivity threshold is 0.5 and the error 
threshold is 0.1, producing MAPE of 0.5184845 as shown in Figure 6(a). Meanwhile, the number of hidden 


nodes is 23 for given minimum MAPE as depicted in Figure 6(b). 
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Figure 6. Visual results of skin cancer testing, (a) MAPE vs error threshold and sensitivity threshold, 
(b) MAPE vs Number of hidden node 
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Table 1 shows the combination of parameters in the modified SECoS method where the activation 
function involves the normalized Euclidean distance formula. The learning rate varies from 0.3 to 0.9, the 
sensitivity threshold is from 0.5 to 0.99 and error threshold is from 0.05 to 0.1. The number of hidden nodes 
is changing dynamically as SECoS algorithm continuously learns from the input received and performs 


adaptive learning. 


Table 1. Classification result with modified SECoS 


Learning Rate 1 __ Learning Rate 2 Sens. Threshold Error Threshold MAPE Node 
0.3 0.3 0.5 0.1 0.614285 28 
0.3 0.6 0.5 0.1 0.671923 33 
0.3 0.9 0.5 0.1 0.645498 37 
0.6 0.3 0.5 0.1 0.585737 40 
0.6 0.6 0.5 0.1 0.622073 45 
0.6 0.9 0.5 0.1 0.611276 47 
0.9 0.3 0.5 0.1 0.599489 45 
0.9 0.6 0.5 0.1 0.595727 49 
0.9 0.9 0.5 0.1 0.616414 51 
0.3 0.3 0.5 0.05 0.649967 32 
0.3 0.6 0.5 0.05 0.686454 40 
0.3 0.9 0.5 0.05 0.676481 41 
0.6 0.3 0.5 0.05 0.627259 45 
0.6 0.6 0.5 0.05 0.649168 48 
0.6 0.9 0.5 0.05 0.615139 48 
0.9 0.3 0.5 0.05 0.614936 48 
0.9 0.6 0.5 0.05 0.636954 54 
0.9 0.9 0.5 0.05 0.599240 54 
0.3 0.3 0.5 0.1 0.518484 23 
0.3 0.6 0.5 0.1 0.592122 32 
0.3 0.9 0.5 0.1 0.564593 32 
0.6 0.3 0.99 0.1 0.527925 24 
0.6 0.6 0.99 0.1 0.589566 32 
0.6 0.9 0.99 0.1 0.564456 33 
0.9 0.3 0.99 0.1 0.539383 24 
0.9 0.6 0.99 0.1 0.582916 32 
0.9 0.9 0.99 0.1 0.557999 34 
0.3 0.3 0.99 0.05 0.567959 28 
0.3 0.6 0.99 0.05 0.585832 33 
0.3 0.9 0.99 0.05 0.579936 34 
0.6 0.3 0.99 0.05 0.556921 28 
0.6 0.6 0.99 0.05 0.597425 33 
0.6 0.9 0.99 0.05 0.577321 36 
0.9 0.3 0.99 0.05 0.550236 28 
0.9 0.6 0.99 0.05 0.591618 35 
0.9 0.9 0.99 0.05 0.556443 37 


SECoS performs best over other methods such as probabilistic neural network (PNN) and support 
vector machine (SVM) in term of false negative as SECoS is able to learn and to adapt input pattern 
continuously. Even dough it true positive is less than SVM. SECoS is more robust in dealing with changes in 


error patterns Figure 7 [31]. 
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Figure 7. SECoS and other classifiers comparison 
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4. CONCLUSION 

In summary, this paper has discussed the classification of skin cancer with an image dataset in the 
form of malignant and benign skin images by using SECoS algorithm. In order to prepare the classifier input, 
grayscaling and HOG feature extraction were performed. The normalized Euclidean distance was employed 
as its performance is better than Hamming and Manhattan distances. The modified SECoS works well in 
performing continuous learning of the received and it changes hidden node adaptively. The evaluation 
process of the skin cancer disease was carried out on 136 datasets with 100 training and 36 testing data. 
Combination process of parameters in the classification obtained the smallest error in the learning rate 1 of 
0.3, learning rate 2 of 0.3, when the sensitivity threshold is 0.5 and the error threshold is 0.1. The minimum 
obtained MAPE is 0.5184845 and the number of hidden nodes is 23 nodes. Even dough true positive of 
SEcoS is smaller than SVM; its false negative is much better which proves that SECoS is more robust in 
handling frequent input pattern variations. 
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